Reward Prediction Error Signals are Metarepresentational
نویسنده
چکیده
Although there has been considerable debate about the existence of metarepresentational capacities in non-human animals and their scope in humans, the well-confirmed temporal difference reinforcement learning models of rewardguided decision making have been largely overlooked. This paper argues that the reward prediction error signals which are postulated by temporal difference models and have been discovered empirically through single unit recording and neuroimaging do have metarepresentational contents.
منابع مشابه
Reward prediction error signals by reticular formation neurons.
As a key part of the brain's reward system, midbrain dopamine neurons are thought to generate signals that reflect errors in the prediction of reward. However, recent evidence suggests that "upstream" brain areas may make important contributions to the generation of prediction error signals. To address this issue, we recorded neural activity in midbrain reticular formation (MRNm) while rats per...
متن کاملPanel Session What Does Dopamine Say: Clues from Computational Modeling
Background: Reinforcement learning models now play a central role in modern attempts to understand how the brain categorizes and values events traditionally framed by psychology as rewards and punishments. These models provide a way to design and interpret of reward expectancy experiments in humans across a wide range of rewarding dimensions. They also provide a connection to computational mode...
متن کاملNeural dynamics of reward probability coding: a Magnetoencephalographic study in humans
Prediction of future rewards and discrepancy between actual and expected outcomes (prediction error) are crucial signals for adaptive behavior. In humans, a number of fMRI studies demonstrated that reward probability modulates these two signals in a large brain network. Yet, the spatio-temporal dynamics underlying the neural coding of reward probability remains unknown. Here, using magnetoencep...
متن کاملReward prediction error computation in the pedunculopontine tegmental nucleus neurons.
In this article, we address the role of neuronal activity in the pathways of the brainstem-midbrain circuit in reward and the basis for believing that this circuit provides advantages over previous reinforcement learning theory. Several lines of evidence support the reward-based learning theory proposing that midbrain dopamine (DA) neurons send a teaching signal (the reward prediction error sig...
متن کاملCognitive Strategies Regulate Fictive, but not Reward Prediction Error Signals in a Sequential Investment Task
Computational models of reward processing suggest that foregone or fictive outcomes serve as important information sources for learning and augment those generated by experienced rewards (e.g. reward prediction errors). An outstanding question is how these learning signals interact with top-down cognitive influences, such as cognitive reappraisal strategies. Using a sequential investment task a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Nous
دوره 48 2 شماره
صفحات -
تاریخ انتشار 2011